
APPARATUS AND METHOD FOR CONTROLLING A MASTER/SLAVE 
SYSTEM VIA MASTER DEVICE SYNCHRONIZATION 

Brief Description of the Invention 

This invention relates generally to master/slave digital systems. More 
particularly, this invention relates to a master/slave digital system in which the master 
device performs all synchronization operations, thereby obviating the need for phase 
5 alignment circuitry on slave devices. 

Background of the Invention 

Synchronous bus systems have been developed to support higher bandwidth 
digital systems. In a synchronous bus system, data packets are sent between a master 

10 device and one or more slave memory devices. The data packets travel in parallel 
with a system clock and maintain a precise phase to that clock. Typically, 
synchronous memory systems utilize a phase adjusting circuit on the master and on 
each one of the slave devices in order to align the on-chip receive and transmit clocks 
to maximize the reception timing margins both on the master and slave devices. U.S. 

15 Patents 5,432,823 and 5,513,327, which are assigned to the assignee of the present 
invention and which are expressly incorporated by reference herein, describe 
synchronous memory systems with phase adjusting circuitry on both master and slave 
devices. 

The primary disadvantage of using phase alignment circuitry in every slave 
20 device of a master/slave system is that it increases power dissipation. For example, if 
the "standby" power of phase alignment circuitry, such as a Delay Locked Loop, is 
125 mW, a system with 32 slave devices on standby mode will dissipate a total of 4W. 
This power dissipation is prohibitively large for emerging portable applications. 

In view of the foregoing, it would be highly desirable to provide a master/slave 
25 system in which the slave devices do not require phase alignment circuitry, thereby 
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allowing the master/slave system to operate with substantially reduced power 
dissipation. 

Summary of the Invention 
5 A method of operating a master/slave system includes the step of identifying a 

master receive data phase value to coordinate the transfer of data from a slave device 
without phase alignment circuitry to a master device with a universal phase aligner. 
Data is transferred from the slave device to the master device in accordance with the 
master receive data phase value. The master device characterizes a master transmit 

1 0 data phase value to coordinate the transfer of data from the master device to the slave 
device. Subsequently, the master device routes data to the slave device in accordance 
with the master transmit data phase value. 

A master/slave system includes a set of slave devices, with each slave device 
having a clock circuit without phase alignment circuitry. A master device with a 

1 5 universal phase alignment circuit includes a phase value register bank storing a set of 
phase values for the set of slave devices. The master device utilizes a selected phase 
value of the set of phase values to alter a system clock signal in accordance with the 
selected phase value so as to establish synchronous communication between the 
master device and a selected slave device of the set of slave devices. 

20 The invention provides a master/slave system in which the slave devices do not 

require phase alignment circuitry. Thus, the master/slave system operates with 
substantially reduced power dissipation. The apparatus of the invention can be 
operated in a high throughput mode wherein each slave device is assigned a first phase 
delay value for data transmission and a second phase delay value for data reception. 

25 In a simpler, lower throughput mode, individual phase delay values are not stored. 
Instead, the master designates different slave devices to process different data edges, 
depending upon the phase information the master device receives during a calibration 
operation. 

30 Brief Description of the Drawings 
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For a better understanding of the invention, reference should be made to the 
following detailed description taken in conjunction with the accompanying drawings, 
in which: 

FIGURE 1 illustrates a master/slave system constructed in accordance with an 
5 embodiment of the invention. 

FIGURE 2 illustrates a universal phase aligner, in accordance with an 
embodiment of the invention, for use in the system of Figure 1 . 

FIGURE 3 illustrates a master receive data calibration method executed in 
accordance with an embodiment of the invention. 
10 FIGURE 4 illustrates a master receive data operation executed in accordance 

with an embodiment of the invention. 

FIGURE 5 illustrates a master transmit data calibration method executed in 
accordance with an embodiment of the invention. 

FIGURE 6 illustrates a master transmit data operation executed in accordance 
1 5 with an embodiment of the invention. 

Like reference numerals refer to corresponding parts throughout the drawings. 

Detailed Description of the Invention 

Figure 1 illustrates a master/slave system 20 constructed in accordance with an 

20 embodiment of the invention. The system 20 includes a master device 22 and a set of 
slave devices 24. Figure 1 illustrates a single slave device 24 for the purpose of 
simplicity. However, it should be appreciated that the system 20 includes many slave 
devices 24, each of which is configured as the device shown in Figure 1 . The master 
device 22 includes a universal phase aligner 30, which operates to coordinate the 

25 phase alignment in all communications between the master device 22 and the slave 
devices 24. In other words, the universal phase aligner 30 obviates the need for phase 
alignment circuitry in each of the slave devices 24. In the absence of phase alignment 
circuitry in each of the slave devices 24, the master/slave system 20 dissipates a 
relatively small amount of power. 

30 Figure 1 illustrates that the master device 22 includes logic and control 

circuitry 32. The operation and implementation of the logic and control circuitry 32 is 
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independent of the invention, since the universal phase aligner 30 may be used with 
practically any type of logic and control circuitry 32. 

In the embodiment of Figure 1, the universal phase aligner 30 produces a 
system clock (SYSCLK) on line 33. In an alternate embodiment, the system clock on 
5 line 33 is generated by a separate clock source that is external to the universal phase 
aligner 30. The universal phase aligner 30 also produces a master request clock 
(MRQCLK) signal, which controls the timing of an output buffer 34. The output 
buffer 34 receives slave access request control and data signals from the logic and 
control circuitry 32. The output buffer 34 applies the signals to a request bus 40. An 
10 output buffer 34 is provided for each line of the request bus 40. 

The universal phase aligner 30 also produces a master receive data clock 
..^ (MRDCLK) signal, which is used to control a latch 36, which may be a flip-flop. The 

*.3 latch 36 latches data from the data bus 42. The data bus 42 carries data to and from 

\r\ the master device 22 (i.e., the data bus 42 is bi-directional). A latch is provided for 

i'S 15 each line of the data bus 42. 

; iaa s As discussed below, the MRDCLK signal is a phase adjusted system clock 

» signal for a selected slave device 24. The MRDCLK signal is synchronized to data 

; 1 received from the selected slave device of the master/slave system 20. The MRDCLK 

v 2 signal accounts for the time between when the system clock is used to launch data 

w 20 from a slave device 24 and the time that the data is received at the master device. By 

accounting for this time difference, expressed as a phase difference between the 
system clock and the MRDCLK, the slave device 24 and the master device 22 can 
exchange data in phase. 

The universal phase aligner 30 also produces a master transmit data clock 
25 (MTDCLK) signal, which controls the timing of an output buffer 38. The output 

buffer 38 drives data from the logic and control circuitry 32 onto the data bus 42. An 
output buffer 38 is provided for each line of the data bus 42. 

As discussed below, the NlTDCLK signal is a phase adjusted system clock 
signal for a selected slave device 24. The MTDCLK signal is synchronized to 
30 transmit data, such that the selected slave device of the master/slave system 20 

receives the data in phase with the system clock. The MTDCLK signal accounts for 
the time between launching data from the master 24 to the time that it is received at 
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the slave 24. This time difference, expressed as a phase difference between the 
system clock and the MTDCLK, all >ws the master device 22 to launch data in a 
manner such that it is received at a selected slave device 24 in phase with the system 
clock. I 
5 Figure 1 further illustrates a side band (SB) bus 46 connected to the logic and 

control circuitry 32. The side band bus 46 may be implemented as a slow bus, such as 
a daisy chain, which carries control signals that are not time critical. 

Each slave device 24 of the master/slave system 20 includes a clock circuit 50 
and a set of logic and control circuitry 52. The clock circuit 50 does not include any 

10 type of phase alignment circuitry. Instead, it is merely implemented with an 

amplifier, a buffer, and duty cycle correction circuitry, if necessary. The amplifier 
and buffer may be used to amplify and buffer the system clock (S YSCLK) received 
from line 33. The amplified and buffered system clock (SCLKO) is used to gate a 
request bus flip-flop 54, a data bus input flip-flop 56, and a data bus output buffer 58. 

15 The request bus flip-flop 54 is attached to the request bus 40, while the data bus input 
flip-flop 56 and the data bus output buffer 58 are connected to the data bus 42. Flip- 
flops 56 and output buffers 58 are provided for each line of the request bus 40 and 
data bus 42. The implementation of the logic and control circuitry 52 is inapposite to 
the invention, as a variety of logic and control functions may be implemented in 

20 accordance with the synchronization scheme of the invention. 

The operations performed by the components of Figure 1 are more fully 
appreciated with reference to Figure 2, which illustrates a universal phase aligner 30 
constructed in accordance with an embodiment of the invention. 

The universal phase aligner 30 may include a standard system clock source 60. 

25 Alternately, an external system clock may be used. The system clock signal 

(S YSCLK) is applied to line 33 of Figure 1 and to a phase rotator 62. As its name 
implies, the phase rotator 62 operates to rotate the phase of the system clock 
(S YSCLK) in accordance with a value specified by a phase error signal. The phase 
rotator 62 may be implemented with a phase-locked loop, a delayed lock loop, or 

30 equivalent device known to those skilled in the art. 

The output of the phase rotator 62 is an adjusted system clock signal, which 
may be the previously described master request clock (MRQCLK), master receive 
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data clock (MRDCLK), or the master transmit data clock (MTDCLK). The adjusted 
system clock signal is applied to a flip-flop 64, which also receives a clock signal 
from a selected slave device. In particular, a signal is received at latch 36 of Figure 1. 
A synchronizer 66 operates to stabilize the output from the flip-flop 64. The 
5 synchronizer 66 may be implemented as a series of flip-flops. 

A counter 68 is used to quantify the phase difference between the adjusted 
clock signal and the clock signal received from the selected slave device. For 
example, the counter 68 may be implemented to count digital values read from the 
flip-flop 64. In such an embodiment, the flip-flop 64 may be timed by a digital high 

10 adjusted clock signal. A digital high output from the flip-flop 64 indicates that the 
clock signal from the slave device is at a digital high value at the same time as the 
adjusted clock signal. Thus, the signal from the slave device has arrived at the flip- 
flop 64. On the other hand, a low output from the flip-flop 64 indicates that the clock 
signal from the slave device has not been received at the flip-flop 64. In such a case, 

1 5 the counter is incremented to adjust for the phase error. When phase alignment is 
achieved, the counter 68 stores a counter value indicating the phase difference 
between the adjusted clock signal and the clock signal received from the selected slave 
device. 

At this time, a controller 70 reads the counter value in the counter 68. The 
20 phase difference value read from the counter is then stored in a phase value register 
bank 74. In particular, the phase difference value is stored in a register corresponding 
to the selected slave device. Recall that the stored value specifies a phase value 
adjustment to the system clock that will match the clock signal received from the 
selected slave device to the adjusted clock signal. Since the signals are matched, each 
25 signal transition will occur at the same time. This may result in metastability 

problems as the signals are latched into different devices. To avoid this problem, an 
offset adder 72 is used to add an offset value to the phase value received from the 
counter 68. The offset improves data reception timing margins. By way of example, 
the offset may be 90 degrees. 
30 Later when the master 22 communicates with a selected slave device 24, the 

phase value for the selected slave device is retrieved by the controller 70 from the 
phase value register bank 74. The controller 70 then loads the value into the counter 
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68 and disables the counter. Thus, the same phase value is applied to the phase rotator 
62 with every clock cycle. This results in a phase rotation of the system clock 
(SYSCLK) to correspond to the specified phase value. 

Each of the components of Figure 2 has now been described. Attention 
5 presently turns to a more detailed discussion of the operation of the components of 
Figure 2 in the context of the system of Figure 1 . 

As previously indicated, the master/slave system of Figure 1 does not include 
clock phase alignment circuitry in the slave devices 24. Instead, all clock phase 
alignment is performed by the master device 22 for each slave device in the system 
10 20. The master device 22 initially performs two calibration operations: a master data 
receive calibration and a master transmit data calibration. The master data receive 
calibration operation is performed to identify the phase value to be used for receiving 
data from a selected slave device. The phase value obtained from the master data 
receive calibration is used during a master data receive operation. More particularly, 
1 5 the phase value is used to adjust the system clock so that a receiver of the master 
device 22, operating in response to the adjusted system clock, receives data in phase 
from a selected slave device, which is operating in response to the system clock. 

The master transmit data calibration operation is performed to identify the 
phase value to be used to transmit data to a selected slave device. The phase value 
20 secured during the master transmit data calibration operation is used during a master 
transmit data operation. That is, the phase value is used to adjust the system clock so 
that an output buffer of the master device 22, operating in response to the adjusted 
system clock, transmit data that is in phase with the system clock as received at the 
selected slave device. 

25 The master data receive calibration operation 80 is disclosed in connection 

with Figure 3. Initially, a selected slave device applies an alignment signal to the data 
bus (step 82). This step may be initiated by a command on the side band bus 46. 
Thus, for example, the logic and control circuitry 32 of the master device 22, may 
apply a signal to the side band bus 46, which causes the selected slave device 24 to 

30 generate a periodic signal that is applied to the data bus 42. By way of example, the 
periodic signal may be a sequence of on and off digital pulses (i.e., a clock signal) 
generated by the logic and control circuitry 52 and applied to the bus 42 via buffer 58. 
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The master device 22 receives these signals at latch 36. Latch 36 is controlled by the 
clock signal MRDCLK. Initially, the MRDCLK signal is simply the system clock 
S YSCLK (the phase rotator 62 has not applied a phase error to the system clock). The 
universal phase aligner 30 subsequently aligns the MRDCLK signal with the 
5 alignment signal received from the selected slave device (step 84). As previously 
discussed, the counter 68 quantifies the phase error between the MRDCLK signal and 
the alignment signal. When alignment is achieved, the counter 68 holds the phase 
value that achieved alignment. Thus, the next processing step can be characterized as 
the master quantifying the phase value between the system clock and the MRDCLK 

10 clock (step 86). 

The next processing step of Figure 3 is to have the master apply a margin 
offset to the phase value (step 88). As discussed in connection with Figure 2, an 
offset adder 72 is used to add an offset to the phase value to improve the processing 
margin and thereby avoid metastability problems. The final processing step shown in 

1 5 Figure 3 is to have the master store the receive phase value for the selected slave 
device (step 90). In other words, the phase value determined by the calibration 
operation 80 is stored in a register of the phase value register bank 74, the selected 
register corresponding to the slave device that was just calibrated. 

A decision is then made to determine whether all of the slave devices have 

20 been calibrated (step 92). If not, a calibration operation for the next slave device in 
the system is initiated with step 82. Once all of the slave devices in the system have 
been calibrated to obtain a data receive phase value, the calibration operation 80 is 
completed. 

Figure 4 illustrates a master data receive operation 100 in accordance with an 
25 embodiment of the invention. If the master device 22 is to receive data from a 

selected slave device, then the controller 70 retrieves the phase value for the selected 
slave device from the phase value register bank 74. The controller 70 then passes the 
phase value to the counter 68 and disables the counter. As a result, the counter 
continuously applies the established data receive phase value to the phase rotator 62. 
30 Accordingly, the phase rotator 62 rotates the system clock (SYSCLK) to produce a 
phase adjusted MRDCLK signal that is appropriately aligned (with offset) with the 
incoming data from the selected slave device. These operations can be summarized 
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with the processing steps shown in Figure 4. In particular, a first step entails 
controlling the phase rotator 62 in accordance with the data receive phase value for the 
selected slave device (step 1 02), and a second step of receiving data from the selected 
slave device in response to the MRDCLK signal produced by the phase aligner (step 



The master transmit data calibration operation 1 10 of the invention is disclosed 
in connection with Figure 5. The first processing step of Figure 5 is to have a selected 
slave device 24 route request bus data to the data bus 42 as re-routed request bus data 
(step 1 12). That is, the master device 22 advises the slave device 24, for example via 
a signal on the side band bus 46, to re-route request bus data in the form of a set of 
digital high and low signals forming a clock signal. v The logic and control circuitry 52 
of the slave device 24 then re-routes the data from the latch 54 to the buffer 58. 

The master device 22 then aligns the master request clock (MRQCLK) until it 
corresponds to the re-routed request bus data at the slave device 24. Observe that in 
this case, the phase difference created by the distance on the data bus 42 from the 
selected slave device 24 to the master device 22 is accounted for by the data receive 
phase value for the selected slave device, which was obtained during the master data 
receive calibration. Thus, the present calibration operation measures the phase 
difference created by data traveling from the master device 22 to the selected slave 
device 24. That is, the present calibration operation measures the phase difference 
created by data traveling on the request bus 40 between the master device 22 and the 
selected slave device 24. 

In sum, the master device 22 aligns the MRQCLK signal with the re-routed 
request bus data to form a master transmit phase aligned MRQCLK signal for the 
selected slave device (step 1 14). As previously discussed in connection with Figure 2, 
the value in the counter 68 is adjusted until the a phase error signal from the counter 
allows the phase rotator 62 to phase rotate the system clock (SYSCLK) until it is in 
phase with the re-routed request bus data. 

After alignment is achieved, the master has quantified the phase value between 
the system clock and the master transmit phase aligned MRQCLK signal (step 116). 
The master then applies a margin offset to the phase value (step 1 1 8). As previously 
discussed, this is performed by the offset adder 72 to improve timing margins. The 



5 104). 
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master then stores the offset adjusted transmit data phase value for the selected slave 
device (step 120). If all of the slave devices have not been calibrated in this manner 
(step 122), then the processing of Figure 5 is executed for the next slave device in the 
system. If each of the slave devices has been calibrated to obtain a transmit data phase 
5 value, then the calibration operation is complete. 

The master transmit data operation 130 is executed in accordance with the 
method of Figure 6. The phase rotator is controlled in accordance with the transmit 
data phase value for the selected slave device (step 132). The transmit data phase 
value is the phase value that produced the MRQCLK signal that was aligned with the 
10 re-routed data. The phase rotator 62 processes the transmit data phase value to 

produce a master transmit data clock signal (MTDCLK). Data is then transmitted to 
the selected slave device in response to the MTDCLK signal produced by the phase 
.1 rotator (step 134). 

^ In sum, after the master data receive calibration operation of Figure 3 and the 

jf 15 master transmit data calibration operation of Figure 5 are performed, data is 

S exchanged between the master 22 and the slave 24 according to the master data 

receive operation of Figure 4 and the master transmit data operation of Figure 6. The 
;1 system needs to be re-calibrated according to the operations of Figures 3 and 5 

* periodically to compensate for drifts in operating conditions. 

3 20 Attention presently turns to the master device's transfer of communication 

~ between different slave devices 24. There is a latency associated with any such 

transfer of control. The factors that affect hand-over or switching latency include: (1) 

switching the phase of the master clock to the appropriate phase for the active slave 

and (2) informing a slave that it is currently active or inactive. The operation of 
25 switching the phase of the master clock to the appropriate phase for the active slave 

can be limited to below 1-2 cycles by pipelining techniques (e.g., switching the phase 

of the MRQCLK before switching the MRDCLK). 

There are a number of techniques that may be used to inform a slave that it is 

currently active or inactive. For example, a signal on the side band bus 46 may be 
30 used. While this is an acceptable solution in some systems (e.g., graphics memory 

subsystems), this solution might not be tolerable in systems where the master 
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addresses different slaves frequently (e.g., main memory systems with data 
fragmentation). 

Another technique for advising a slave device of its status is to use the request 
bus 40. In this embodiment, an additional "broadcast" bit is needed by the request bus 
5 40. To accomplish an "active hand-over", the master 22 transmits a "broadcast" 

packet on the request bus 40. This packet is transmitted with a pre-determined phase 
of the MRQCLK and is valid for a full clock cycle (i.e., two bit time periods). All the 
inactive slaves 24 continuously sample the data on the request bus 40 with both of 
their negative and positive clock edges. However, they always disregard the data 

10 sampled with one of the two edges according to their local phase. In other words, 

depending upon the position of the slave device in the physical system, it will sample 
one of the two edges of the data. Thus, the slave device is effectively operating at 
half-frequency during this state. The decision as to which datum to process and which 
to disregard is made by the master and is communicated to the slave during the 

15 calibration phase. Since the broadcast bit is valid for a full clock cycle, one of the two 
slave clock edges is appropriate for receiving the broadcast bit and the data on the 
request bus 40 with adequate timing margin. 

When the broadcast bit is a logical ONE, the master 22 broadcasts data on the 
request bus 40 specifying which slave is being invoked. Each slave 24 processes this 

20 broadcast data to determine if the broadcast data matches its specified identification 
value. If the comparison results in a match, the slave goes to the active mode. In the 
active mode, data on the request bus 40 and data bus 42 are processed at full speed 
(i.e., on both clock edges). 

An additional consideration for the active "hand-over" is the amount of time 

25 that the slaves might have to wait before they start transmitting data on the bus so that 
their data will not collide with the data transmitted by the previously active slave (or 
reflections of that data). This amount of time depends on the electrical length of the 
bus. 

The system of the invention can operate in a low power mode that is analogous 
30 to the active hand-over mode just discussed. In the low power mode, the phase of the 
master clock is pre-determined (e.g., it is locked to the system clock). Thus, the 
master does not change the phase of its clocks according to which slave is active. 
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Unlike the previous high bandwidth embodiment in which two bits of data are 
transmitted every clock cycle, in this embodiment only one datum is transmitted per 
clock cycle. 

During calibration for this mode, the master informs each slave which of its 
5 two clock edges to use when it receives and transmits data to and from the request bus 
40 and data bus 42. The master chooses the slave transmit clock edge that produces 
the maximum timing margin on the master's receivers. The maximum timing margin 
is identified during calibration. Since the phase of the normal receive clock of the 
master is pre-determined and the position of the slave's clock edges can be found in 

1 0 the calibration period, the identification of the maximum timing margin is simply 

based on a digital comparison at the master device 22. Similarly, the master device 22 
chooses the positive or negative edge of the slave's receive clock that results in the 
maximum timing margin on the slave receivers. Both the master transmit clock phase 
and the slave edge positions can be determined during the calibration period. 

1 5 The highest peak bandwidth achievable by this system is one half of the 

synchronous bus system that transmits two bits of data per clock cycle. Nevertheless, 
its lower power consumption and simpler control scheme make this embodiment 
appealing for selected applications. 

Those skilled in the art will recognize many alternate embodiments for the 

20 disclosed invention. For example, the invention was described in connection with a 
calibration technique that relied upon a clock signal. The calibration operation may 
also be performed with a random signal, as long as the universal phase aligner 30 can 
identify periodic signal edges associated with the random signal. 

The foregoing description, for purposes of explanation, used specific 

25 nomenclature to provide a thorough understanding of the invention. However, it will 
be apparent to one skilled in the art that the specific details are not required in order to 
practice the invention. In other instances, well known circuits and devices are shown 
in block diagram form in order to avoid unnecessary distraction from the underlying 
invention. Thus, the foregoing descriptions of specific embodiments of the present 

30 invention are presented for purposes of illustration and description. They are not 
intended to be exhaustive or to limit the invention to the precise forms disclosed, 
obviously many modifications and variations are possible in view of the above 
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teachings. The embodiments were chosen and described in order to best explain the 
principles of the invention and its practical applications, to thereby enable others 
skilled in the art to best utilize the invention and various embodiments with various 
modifications as are suited to the particular use contemplated. It is intended that the 
scope of the invention be defined by the following claims and their equivalents. 
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